Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor

Identifieur interne : 000830 ( Main/Exploration ); précédent : 000829; suivant : 000831

Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor

Auteurs : Gérard Huet [France]

Source :

RBID : ISTEX:F0182F346F74AE86190F64F70588F0060134979E

Abstract

Abstract: We discuss the mathematical structure of various levels of representation of Sanskrit text in order to guide the design of computer aids aiming at useful processing of the digitalised Sanskrit corpus. Two main levels are identified, respectively called the linear and functional level. The design space of these two levels is sketched, and the computational implications of the main design choices are discussed. Current solutions to the problems of mechanical segmentation, tagging, and parsing of Sanskrit text are briefly surveyed in this light. An analysis of the requirements of relevant linguistic resources is provided, in view of justifying standards allowing inter-operability of computer tools. This paper does not attempt to provide definitive solutions to the representation of Sanskrit at the various levels. It should rather be considered as a survey of various choices, allowing an open discussion of such issues in a formally precise general framework.

Url:
DOI: 10.1007/978-3-642-00155-0_6


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor</title>
<author>
<name sortKey="Huet, Gerard" sort="Huet, Gerard" uniqKey="Huet G" first="Gérard" last="Huet">Gérard Huet</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:F0182F346F74AE86190F64F70588F0060134979E</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-00155-0_6</idno>
<idno type="url">https://api.istex.fr/document/F0182F346F74AE86190F64F70588F0060134979E/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000681</idno>
<idno type="wicri:Area/Istex/Curation">000673</idno>
<idno type="wicri:Area/Istex/Checkpoint">000352</idno>
<idno type="wicri:doubleKey">0302-9743:2009:Huet G:formal:structure:of</idno>
<idno type="wicri:Area/Main/Merge">000838</idno>
<idno type="wicri:Area/Main/Curation">000830</idno>
<idno type="wicri:Area/Main/Exploration">000830</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor</title>
<author>
<name sortKey="Huet, Gerard" sort="Huet, Gerard" uniqKey="Huet G" first="Gérard" last="Huet">Gérard Huet</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>INRIA Rocquencourt, BP 105, 78153, Le Chesnay Cedex</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Le Chesnay</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">F0182F346F74AE86190F64F70588F0060134979E</idno>
<idno type="DOI">10.1007/978-3-642-00155-0_6</idno>
<idno type="ChapterID">6</idno>
<idno type="ChapterID">Chap6</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We discuss the mathematical structure of various levels of representation of Sanskrit text in order to guide the design of computer aids aiming at useful processing of the digitalised Sanskrit corpus. Two main levels are identified, respectively called the linear and functional level. The design space of these two levels is sketched, and the computational implications of the main design choices are discussed. Current solutions to the problems of mechanical segmentation, tagging, and parsing of Sanskrit text are briefly surveyed in this light. An analysis of the requirements of relevant linguistic resources is provided, in view of justifying standards allowing inter-operability of computer tools. This paper does not attempt to provide definitive solutions to the representation of Sanskrit at the various levels. It should rather be considered as a survey of various choices, allowing an open discussion of such issues in a formally precise general framework.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Île-de-France</li>
</region>
<settlement>
<li>Le Chesnay</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Île-de-France">
<name sortKey="Huet, Gerard" sort="Huet, Gerard" uniqKey="Huet G" first="Gérard" last="Huet">Gérard Huet</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000830 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000830 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:F0182F346F74AE86190F64F70588F0060134979E
   |texte=   Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024